Graying the black box: Understanding DQNs

نویسندگان

  • Tom Zahavy
  • Nir Ben-Zrihem
  • Shie Mannor
چکیده

In recent years there is a growing interest in using deep representations for reinforcement learning. In this paper, we present a methodology and tools to analyze Deep Q-networks (DQNs) in a non-blind matter. Using our tools we reveal that the features learned by DQNs aggregate the state space in a hierarchical fashion, explaining its success. Moreover we are able to understand and describe the policies learned by DQNs for three different Atari2600 games and suggest ways to interpret, debug and optimize deep neural networks in reinforcement learning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rainfall-runoff modelling using artificial neural networks (ANNs): modelling and understanding

In recent years, artificial neural networks (ANNs) have become one of the most promising tools in order to model complex hydrological processes such as the rainfall-runoff process. In many studies, ANNs have demonstrated superior results compared to alternative methods. ANNs are able to map underlying relationship between input and output data without prior understanding of the process under in...

متن کامل

Polygonum multiflorum root extract as a potential candidate for treatment of early graying hair

Despite Polygonum multiflorum (PM) has been experiencely used as a drug to treat early graying hair phenomenon in Asian countries for a long time, there is limited study examined the real biological effects of PM on hair graying in vitro and in vivo. In this study, we investigated the effects of PM root extract (PM-RE) on melanin synthesis in human melanoma SKMEL-28 cells and embryos/larvae of ...

متن کامل

Knowledge Extraction from the Neural ‘Black Box’ in Ecological Monitoring

Phytoplankton biomass within the Saginaw Bay ecosystem (Lake Huron, Michigan, USA) was characterized as a function of select physical/chemical indicators. The complexity and variability of ecological systems typically make it difficult to model the influences of anthropogenic stressors and/or natural disturbances. Here, Artificial Neural Networks (ANNs) were developed to model chlorophyll a con...

متن کامل

Distributed Black-Box Software Testing Using Negative Selection

In the software development process, testing is one of the most human intensive steps. Many researchers try to automate test case generation to reduce the manual labor of this step. Negative selection is a famous algorithm in the field of Artificial Immune System (AIS) and many different applications has been developed using its idea. In this paper we have designed a new algorithm based on nega...

متن کامل

Effect of Carbon Black Content on Curing Behavior of Polysulfide Coatings

Polysulfide is well known elastomer for use in aerospace applications due to providing flexible coating and chemically resistant sealants. In this work, the effects of carbon black content on curing behavior of polysulfide elastomer were investigated and Rheological properties for samples with different filler content 15, 20 and 25 phr were evaluated by rheometric mechanical spectrometer RMS. A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016